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ABSTRACT 



The need for secure voice communication systems is 



increasing both in the 
Coupled with this 



civil and military arenas, 
is the need for conserving 
bandwidth, increasing performance, and reducing costs. 
Currently used secure voice metnods are relatively 
antiquated and do not provide desired performance and 
bandwidth conservation without incurring increasing 
costs. A new system, proposed herein offers bandwidth 
reduction, increased performance, and decreasing costs 
while using modern digital techniques as opposed to 
analog techniques. The proposed system, known as 

VOCOM, operates in existing voice bandwidths using 
existing equipment, and offers a higher level of 
privacy and security while at the same time 
simplifying software handling. Additionally, the 
proposed system offers the user real-time operation to 
enhance critical decision-making. 
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• MIRQDuction 



The need for security in voice coinsiunicatioiis systems is 
sreadily increasing. Sensitive information is constantly 
being passed between people which, if intercepted by 
unintended or undesireable elements, could significantly 
affect the original purpose of the communication. As a 
consequence, the communicating parties stand to lose money, 
position, status or crucial elements of their own livelihood 
including national or personal security. Some protection 
methods are relatively antiquated and ineffective^ Compared 
to the current electronic ’’state of the art" for 
intercepting uncovered transmissions as well as covered 
transmissions , 

Voice security requirements reach into wide areas of 
business, civil and military communications. The need for 
voice security is obvious in credit, stockraar ket , and 
banking operations where information transferred by voice on 
uncovered lines demands confidentiality. These systems are 
vulnerable to attack by eavesdroppers and intelligence 
gatherers seeking to sabotage or threaten communications. 
Lav enforcement communication systems clearly need 
protection of voice communications. Although many of the 
police systems employ coding schemes, determined 
interceptors can, by patient association^ break the spoken 
codes . 

In the military voice communication applications^ 
elaborate daily- changing coding schemes have been developed 
as a sophisticated method of ensuring communications 
security. However, determined or hostile agents need only 
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II. 



VOICE CODING SYSIEI-iS 



Voice coding systems are of two general types: analog 
and digital. Digital systems convert voice signals directly 
into a number or digit stream, transmitting these bits in 
place of a voice signal. Existing digital systems require 
more than the nominal 3000 hertz bandv;idth available in most 
telephone applications. A digital system is therefore 
referred to as a wideband system. Wideband is defined as 
being several tiroes the unencoded base band signal width 
compared to a narrow band of approximately the same 
bandwidth £16]. The wide band virtually eliminates retrofit 
compatibility vfithout extensive rev/ork of existing 
communications systems. [21] Digital characteristics are 
well suited to systems employing pseudo-random encoding data 
streams. [8] Digital systems typically have 
analog- to-digital and digital-to-analog converters with 
coding and decoding of a digital data stream. 

A. ANALOG SYSTEMS 

Analog systems are used in voice security systems more 
extensively than digital systems. They are characterized by 
balanced mixers, oscillators and filters. The various types 
are discussed below. [8] 
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Inversion 



Inversion is' the naue for a scrambling process that 
provides security by systematic modification of a voice 
signal before transmission. In its simplest form it 
interchanges low voice frequencies and high voice 
frequencies. It operates by changing each frequency 
component present in a voice signal to a new frequency, 
where the new frequency is the difference between the 
original frequency and the reference or inversion frequency. 
For example, at a reference frequency of 3000 hertz, a voice 
component at 750 hertz would be converted to a component at 
3000-750 hertz or, 2250 hertz. A scrambled signal must be 
unscrambled at the receiving end using a second inverter of 
the same reference frequency. When tlie scrambled 2250 hertz 
frequency is subtracted from the reference frequency, 750 
hertz, the original frequency voice component is restored. 
The ease of unscrambling inverted speech makes this system 
vulnerable to unsecure transmission. An eavesdropper need 
only use an inverter with an adjustable reference frequency 
oscillator, tuning the oscillator until the speech is 
intelligible. Moreover, with concentrated attention, 
inverted speech can be learned directly in about four hours. 
Figure 1 depicts inversion. [8] 
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2 . 



Band- Split ti ng 



A more secure method of scrambling either divides 
the 300-^3000 hertz voice band into several subbands, or it 
inverts them, or both. This is known as band-splitting. 
Unscrambling is achieved by interchanging the signals in the 
subbands and reinverting them as required. Its advantage 
over the simple inversion technique is that many different 
code settings are possible according to how the different 
subbands are rearranged in the scambling process. Unlike the 
inversion technique, one cannot learn to directly understand 
the scrambled output of a band-splitter. However, by 
repeating a message several times, many of the words can be 
unscrambled by the huaian ear. It is also possible to 
eavesdrop by using equipment that returns just one of the 
subbands to its proper place thus rendering this method 
susceptible to relatively simple attacks. Figure 2 depicts 
band-splittirg. [8] 



12 




1000 2000 3000 Hertz 

(a) Original voice signal spectrum 




1000 2000 3000 Hertz 

(b) Scrambled voice signal spectrum; (note 
both displacement and inversion of the bands) 



Figure 2 - BAND-SPLITTING 
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3 . Ma skincj 



Security offered by inversion and band-splitting 
technigues can be enhanced by adding extraneous tones or 
ncise in a scrambler to mask the voice. These additions must 
be filtered out by an unscrambler . The difficulty of 
filtering is aggravated by the presence of harmonic 
distortion in transmission systems. Such distortion will 
generate noise and tones at new frequencies where they 
cannot be removed without also removing some of the voice 
signals. Consequently, masking increases the security of 
voice communication while reducing message intelligibility 
by the intended listener. 
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5 • Codes 

The ffethods discussed thus far were assumed to use 
fixed codes. Using a continually changing code sequence at 
different times increases the difficulty cf intelligible 
interception by an eavesdropper unless he had access to the 
specified cede sequence being used at a particular time. In 
a properly designed system enough different codes can be 
used so as to make it impractical for an eavesdropper to 
find the correct one. This method is enhanced even further 
if the code is changed as often as each day. This system 
requires frequent dissemination of particular codes to the 
users and security precautions to ensure that the codes do 
not fall intc unauthorized hands. 

B. DIGITAL TECHNIQUES 

The quality of digital transmissions is affected by the 
number of encoding levels in the fait stream. Tne larger the 
number of levels, the more bandwidth is required to transmit 
the signals. Bandwidth being a scarce commodity, an increase 
in bandwidth usage will result in an increase in cost, 
conventional digital voice security systems are 
characterized by by use of larger amounts of bandwidth 
(wideband) than analog voice security systems (narrowband) . 
They are consequently more expensive than analog systems. 
Nonetheless, digital systems have technical advantages, 
especially when sending vast amounts of data over long 
distances where storing of the data is required. Sending 
analog signals over long distsances requires expensive lines 
and high quality radio links. Long telephone lines can 
cause drastic attenuation in high frequencies. This effect 
can be overcome only by the use of costly coaxial or other 
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special cables. In contrast, even if transmission quality is 
poor, a digital system need only detect whether a "0" or a 
'‘1» was sent. It need not detect complicated and detailed 
waveforms of speech. Much poorer signals can be acceptably 
decoded and no noise or interference is present in the data 
sent; the output will be noise and interference free. 

Another method of telephone tranmission converts sounds 
into a stream of bits (pulse code modulation) and eliminates 
need for the costly modems required in analog systems. [16] 
The pulse code modulation (PCK) techniques being constructed 
equate to one channel becoming the equivalent of 56,000 bits 
per second in each direction. During an experiment in 
Europe by Eartin [16] a computer, a teletype machine, and a 
telephone line as the communication link took place. During 
one observation an analog voice line capable of transmitting 
4,800 bits per second was used. In a half hour it could 
transmit 1800 x 4,800 or 8,640,000 bits. In fact, it had 
only sent 21,000 bits of data. Its efficiency was 21,000/ 
8,640,000 or .0024, a poor use of an expensive facility. 
Voice lines use ECM techniques in which one telephone 
channel beccmes equivalent to 56,000 bits per second in each 
direction. This bit stream could conceivably transmit 1,800 
x 56,000 bits in a half hour in each direction. Using time 
sharing, the transmission efficiency could then be said to 
be. 21,000/2 x 1,800 x 56,000 or .0001. If we could push the 
efficiency up to .25 we would have an improvement of 2,500 
times. On an analog voice line used at 4,800 bits per 
second, a one hundred fold improvement would result. This 
can be accomplished through time sharing. [16] 

Analog storage is bulky and cumbersome, requiring tapes, 
and discs that are susceptible to damage loss, or 
misplacement. The use of digital storage techniques offers 
the user large amounts of storage capacity, fast access 
time, processing in real time, and minimal human handling* 
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A new system proposed here reduces PCM reguireraen 
56,000 bits per second to a mere 200 bits per secon 
increased advantage goes from an efficiency of 
example cited) -0024 to 3000 x 7/1,800 x 200 or .0583 
bit rate reduction is over 400:1. In general, a deer 
bandwidth decreases expense. Therefore, restricti 
bandwidth decreases expense. This bandwidth re 
technique can be applied to voice security systems. 
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III. PROPOSED VOICE CODING SYSTEM 



A. P.EASCH POR DIGITAL 



In normal speech there is a considerable redundancy of 
expression which means that more symbols are transmitted 
than are required to communicate the information. Parts of 
words and often whole words frequently are not required to 
communicate effectively. Naval messages typically are 
reduced to acronyms, abbreviations, and shortened words 
which are understandable zo the communicating parties. An 
example of redundancy is "ueue” in "queue". The "ueue" 
sound always follows the "g" and therefore the "ueue" is 
redundant. "The" is also frequently redundant. Most 
redundancy results from rules and limitations placed on 
languages, excluding usable combinations of letters. In a 
language permitting any permutation of four letters to be a 
word, such as "ngwv", then 456,976 words, or approximately 
the number of words in an unabridged dictionary, would 
exist. The English language prohibits a combination such as 
"ngwv", rendering it more redundant than the hypothetical 
four letter language. Limitations on vocabulary add to the 
wasre. A child's use of the word "play" may be changed by an 
adult to "frolic" or "amusement". It is more redundant for 
someone to "accomplish something" than it is for someone to 
"do something". According to Shannon, [24] rwo extremes of 
redundancy exist: one extreme is in use of additional whole 
words to add emphasis to an idea; the other extreme is in 
superfluous inflections and drawls of various dialects. 
"The basic English vocabulary is limited to 850 words and 
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the redundancy very high. This is reflected in the expansion 
that occurs where a passage is translated into basic 
English . '* 

Rules, liiaitations, formalities, and the desire to 
modify language and speech create redundancy. As a result, 
English is about 75% redundant which is to say only 25% of 
English text is necessary if it were wholly nonredundant . 
[ 10 ] 



In information theory entropy must be eliminated from a 
system. This is precisely what the proposed digital system 
does. By reducing the input data to a nonredundant level, 
mucli useless data is discarded while working parts are 
retained. As an example, data is transferred at the rate of 
60 kilobits fer second in Bell Telephone digital links with 
enormous redundancy since human speech conveys meanings at 
the rate of cnly a few hundred bits per minute. Speech as 
an audio signal is limited to a data rate of only several 
hundred bits per second. The proposed digital system 
transmits speech virtually without redundancy. Known as 
VOCOI-], (Voice CCIlmunication through COMpression and 
COMputation) it sends a series of digits that instruct a 
synthesizer tc recreate speech, instead of sending the 
original speech waveforms. Only several hundred bits, a 
fraction of the real amount, need be sent. [25] Because the 
human ear is sensitive to amplitude and frequency changes, a 
VOCOM processed signal will vary from the original yet still 
have sufficient quality to be intelligble. [17] Potential 
losses from data compression include recognition of who is 
speaking, transmission of the emotional content, and 
conversational effort. 



r 
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B. 



DESIGN OF THE VOCOM SYSTEM 



The design of a voice security system must be compatible 
with existing radio telephone equipment. It must also 
provide a reasonable amount of privacy against not only the 
"snooper'* on the EF channel but also against the loss or 
compromise of eguipmear. Finally, it must be reasonable in 
cost. The constraint of making the security system 
compatible with existing equipment restricts the 
communication channel to 300 to 3000 hertz. 



A digital computer has been designed to receive a 
continuous electrical signal and transform it into data at a 
comparatively slow rate for input into a general purpose 
processor. The machine uses digital circuits throughout to 
compute instantaneous values of frequency and power in real 
time. This machine is a special purpose digital computer 
capable of various modes of transformation through the use 
of Fourier transforms. This makes it easily applicable to 
the analysis of ’speech. It also allows hardware to be 
time-shared among several filters capable of examining 
components in any band of the audible spectrum. The digital 
filter bank, consisting of 128 filters, is capable of 
providing 10 octaves of data at semitone intervals to rhe 
digital oscillator bank, with 64 oscillators. Although it 
was designed for transmitting music, its potential is great 
in secure voice applications. (Contact for future 
reference; Mr. Alan Sutcliffe, Electronics Music Studios, 
277 Putney Bridge Road, London, SW152PT England.) 
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c. 



HARDWARE 



The "heart" of the system relies upon the PDP-8 
minicomputer manufactured by Digital Equipment Corporation 
(146 i“ain Street, Maynard, Massachusetts, 01754). It uses a 
12-bit word length intended for laboratory and process 
control applications with original system prices of 
approximately $28,500. [8] Originally not called a 
minicomputer, it rapidly became very popular and soon became 
recognized as the first mass-produced and popular 
minicomputer and the first computer to sell for less than 
$20,000 (CPU only) . A memory instruction can reference any 
of 128 aidresses on its own page, or any of 128 addresses on 
other pages. With indirect addressing, any location in 
memory can be referenced. These 128 addresses coincide with 
the 128 filters and 64 oscillators in the VOCOM system. 

The digital hardware consist of rwo PDP8 computers, a 
disc file, and a fast paper reader/punch with an attached 
magnetic tape drive. The input and output system makes it 
suitable for real-time applications. A crystal clock in tne 
interrupt line delivers synchronizing pulses at 400 hertz or 
a . sub-multiple of this frequency. There are also 10 
kilohertz digital- to-analog and analog-to-digital converters 
for visual purposes. £30] 

The computers control the pitch, timing, amplitude, and 
waveform through three banks The computers also control the 
gain and respose mode of 64 narrow passband filters placed 
at semitone intervals over five and one half octaves. Nine 
other oscillators and function generators, six amplifiers, 
two variable response filters, and a number of other devices 
such as noise generators are also controlled by the 
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computers. Host of the connections are done manually at a 
patch panel but up to twenty of them may be connected 
through computer controlled audio switches. [9] 

There must first be an analog signal from a telephone, 
radio or ether sound source. The signal is fed through a 
series of specialized filters and compressors to the 
analysis section of the VOCOH receiver srill in basic audio 
signal form. In the next step a digital analysis occurs 
where a number of parameters may be varied which determine 
the amount of digital data to be stored or transmitted. 

These varying parameters are rate analysis, normally 
20-30 times a second for speech; the numoer of points on the 
frequency spectrum to be sampled; and the precise 
frequencies for each of these poiars. Up to 64 individual 
frequencies (individually) can be analyzed ranging from 0-16 
kilohertz. For each of these points, up to 64 levels may be 
detected thus allowing a large amount of data to be absorbed 
by the receiver. [29] 
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understandable analog signal. This system is unique in that 
both the receiver and the transmitter are computers. It is 
not data that is transmitted as much as it is precise 
instructions. Because these instructions require little data 
to cause very large changes in the final output, a data 
reduction is possible. This synthesizing machine is 
programmable in waveform, frequency, amplitude, and time of 
change of frequency and amplitude. 
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D. SOFTWARE 



Specially developed programs called VOCAB and HUSYS 
allow real-time transformation into computer instructions. 
At this point data reduction eliminates redundant speech 
parts and keeps the meaningful parts. For example, if a 
spoken word is drawn out it is not necessary to continue the 
sound every instant until it is complete; rather, it is only 
necessary for a computer instruction to say "continue this 
sound at this rate for a certain period". The VOCOM 
capability cf storing, mixing, and continuing sounds 
increases the security of voice communications. Other 
reductions pick out peaks in the data, calculate variations 
in the frequency and amplitude and identify the sound source 
such as a telephone. Telephone identification offers 
further reduction since only the bandwidth of the source is 
required. Data reduction eliminates normal pauses and gaps 
in speech. As an example of the instructions: 

"do nothing for .23 seconds." 

"keep on going like you are for .11 seconds." 

"Change the freguency 230 hz over .13 seconds at rate X 
until silence." 

"It is the end of a sentence, drop the overall pitch for 
.10 seconds at rate 4." [30] 

The digital data between the receiver and transmitter 
can be transmitted at a rate of less than 1000 bits per 
second for speech and higher if it becomes necessary. It is 
estimated that telephone speech can be transmitted at 200 
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bits per second, representing a data reduction when 
tra nsmitting . 

To reiterate pertinent points; a digital computer is 
able to program 64 oscillators, each capable of producing 
three periodic waveforms at any amplitude. In theory the 
system is capable of reproducing any sound. Amplitude and 
frequency change is separately defined for each oscillator. 
A crystal clock can communicate to and from the computer 
giving interrupts at appropriate programmable intervals. 
Three output amplifiers can be digitally controlled for 
overall dynamic changes. [29J 



E. COST CONSIDERATIONS 



This system is not only reliable but also inexpensive. 
Hardware prices quoted by Digital Corporation, the 
manufacturers of the PD?8 minicomputer [3, 13] indicate that 
the hardware can be purchased for approximately $40,000. the 
figures listed below are within ten percent accuracy. 



PDP8 Computer 
Memory Box 
Disc File 

Paper Tape reader/Punch 
LA 36 Terminal 
Cabinet 

Bootstrap Loader 
9 Track Magnetic Tape Unit 
Crystal Clock 



$7,600 (2 required) 
$5,000 

$3,950 (2 required) 

$4,200 

$2, 175 

$850 

$ 500 

$11,500 

$ 400 
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15 Channel Digital/Analog Converters $1,000 (2 required) 

Total $39,125 [3, 13] 

Compared to hardware components of another system such as an 
equivalent lE^i 360 system to accomplish the same job, the 
cost to purchase the system is estimated to be $200,000. 
[20] Although the IBM 360 has a greater capability than the 
PDP8, this comparison is meaningful since large computers 
are used for similar applications. Use of other 
minicomputers could yield similar results of the PDP8. 

Some hardware components differ significantly in price 
and would seem to be unjustified until further inspection is 
made. For example, in an analog system, an analog multiplier 
would cost approximately $15 while in a digital system a 

digital multiplier would cost about $125; an important 

advantage exists in the use of digital multipliers in that 
they can be time-shared among the 64 oscillators at a cost 
savings of 600%. Additionally, the steadily decreasing costs 
of large scale integration devices makes digital equipment 
increasingly attractive. [6] 

The overall system offers enormous savings in the 

channel capacity needed for transmitting voice signals for a 
fractional increase in the cost of the terminal equipment. 
For assessing costs and benefits the following factors, 
using a single line, must be considered: 

V=CxlxF 

C cost of line per mile 

L length of line in miles 

V VOCCM terminal cost 
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F compression factor 



F. HYPOTHETICAL APPLICATION 

Currently, normal 3000 hertz voice grade telephone lines in 
the United States cost S5.48 per month per mile. [22] These 
lines are unconditioned , private, leased, and capable of 
full duplex operation. They are also capable of carrying 
either voice or data signals and are single channel. On 
short lines, of, say, less than 100 miles, the cost of 
terminal equipment dominates while on long lines, the lines 
themselves determine the costs. To illustrate an example, a 
single line from the Naval Postgraduate School to 
Washington, C.C., a distance of approximately 3000 miles, 
would cost $5.48 X 3000 or $16,440 per month. Multiplying 
this by 13, the number of autovon lines at the school, the 
cost goes to $213,720. It is possible, through use of the 
VOCOM system, to use one single existing line and through 
multiplexing, still have the equivalent of 13 full duplex 
lines. This cost savings in line usage alone would amount 
to $197,280. In actual practice the school only pays for the 
terminal equipment at the switchboard and the major expense 
for the autovcn is borne by COMNAVTELCOM. 

The autovon lines are actually switched at a switching 
center at Lodi California, a distance of approximately 76 
air miles. The thirteen lines going to Lodi cost monthly 
$5,460, while if only one line were used the cost would be 
only $420, a significant savings. Thirteen channels could 
conceivably, by using the VOCOM system, reduce the line 
costs at a savings of $5,040 per month. One VOCOM system 
costs approximately $50,000 and would, through annual 
savings, pay for itself in less than one year. 
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More importantly is the fact that the secure voice 
feature could be added at a relatively inexpensive price. 
Measuring the actual value of the secure voice capability is 
difficult since the urgency of the needed information is 
subjective and requires higher level decision-making before 
action can be taken. The value therefore cannot be expressed 
in dollars but rather in time savings. In this example 
classified information could be exchanged between personnel 
from the Naval Postgraduate School and personnel from 
Washington D. C. in real-time as opposed to the typically 
delayed two or more weeks which it takes to classify, and 
mail the information. 

A simple, single line user serves to show potential 
savings on line costs alone. A single line is capable of 
handling 4,800 bits per second. [10] By reducing the amount 
of data required for voice to 200 bits per second, the 
potential for 24 channels exists. Line conditioning, a 
modification to lines allowing increased data handling 
capability, increases further the handling capacity of this 
system. As an example, a 96 channel VOCOM unit could be put 
in to service with the new 96 channel Bell D2 channel bank 
enabling the equipment to operate on a single T1 line v/hich 
otherwise has a capacity of only 24 channels. This immediate 
four-fold boost to line profitability could be further 
increased up to 100 times the previous traffic depending on 
voice fidelity and time allocation requirements. 
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1975 and projected 1980 costs of the VOCOfI systems 
including hardware, software, [25] and other direct costs 
are illustrated below. 



1975 


1 


25 


1000 




system 


system s 


sy stems 


100 lines 


$6,250 


$2,500 


$1,250 


10 lines 


$15, 000 


$6,250 


$3,750 


1 line 


$50,000 


$25,000 


$25,000 


1980 








100 lines 


$3,750 


$1,250 


$750 


10 lines 


$10,000 


$3,750 


$2,500 


1 line 


$37,500 


$15,000 


$10,000 



These figures are for use with trunk telephone lines. As 
the number of lines increases, the amount of cost decreases 
significant'/. The system could be used as shown in figure 
3. Figure 4 shows cost/ben ef it' relationships. 
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Figure 3 - TYPICAL SETUP 
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Figure 4 -> COST/3EMEFIT FOR CHAWNEL COMPRESSION 
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Original development and cost of the VOCOfl software is 
derived from the total cost of one system ($75,000) less the 
cost of the hardware ($40,000) or approximately $35,000. 
This sunk cost in technology and development will be reduced 
as the number of systems increases. 



G. SEGUEING IHE SYSTEM 

There exist today numerous voice coding systems. Various 
modifications to modulation techniques incorporate some 
degree of bandwidth compression in the encoding process. 
However, no digital coding scheme satisfactorily encodes 
speech at less than 19.2 kilobits per second; thus, to 
achieve a 9.6 kilobits per second (maximum amount of data 
handling capacity on present lines) voice digitizer, an 
additional bandwidth compression of at least 2:1 must first 
take place. [8] These systems obviously require v/ider 
bandwidth parameters than are necessary with the VOCOH 
system. They also attempt to reproduce all the elements 
(accent, drawl, emotion) unique to the speaker but are still 
not successful at exact duplication. 

Instead of reproducing all these elements, the VOCOM 
system recreates and produces a sound by using coded data of 
much reduced density. The sound is not a reproduced exact 
sound of the speaker but a synthesized recreation of it 
using sophistica ted sound generators. For the individual 
user, a terminal box must be available which is simply a 
small portable box similar to an electronic calculator which 
will attach to a normal telephone handset. This terminal is 
forecasted to cost approximately $30 but it is expected that 
the user will rent the terminal along with other user 
services such as a supply of confidential code numbers and 
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directories for a particular user or group of users. By 
keying specified code numbers on the terminal box the user 
gains access to the VOCOd system and can speak through his 
terminal box in a secure fashion. The confidential code can 
ensure complete security by both changing codes at regularly 
scheduled intervals (hourly, daily, randomly, etc.) and can 
further be scrambled by changing codes automatically. In the 
VOCOd system any of the methods described in the 
introduction could be used. It has the further advantage of 
"multi-scrambling " from user to user. [27] It would be 
possible for example to have a conference call with several 
people talking and none of them receiving the same exact bit 
stream because of their own personal codes. 

Software such as clumsy and bulky keylists which 
currently exist in the military could be reduced bcth in 
size and in complexity. Each authorized command or person 
could receive monthly codes instead of programming keystream 
generators or crypto boards. He would use his code for the 
particular period, punch it into the terminal, and commence 
communicating securely. A Naval operational commander (or 
anyone with proper clearance and need to know) could get 
real-time resolutions to problems instead of waiting for 
misunderstandings in messages, incomplete or delayed 
messages, and mistakes in messages. 
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IV 



CONCLUSIONS 



The need for secure voice communications is ever 
increasing both in the military and in civil circles. More 
importantly, the need to acquire this capability while at 
the same time reducing costs and improving performance is 
highly desirable. Present systems in use today are for the 
most part analog systems which require large usage of 
bandwidth, expensive equipment, and cumbersome, bulky, and 
awkward software support. Large bandwidth usage is 
virtually synonymous with large costs. 

In the military there is a groat need for secure voice 
communications and prohibitive costs limit the number of 
available secure voice terminals. Both voice quality 
improvement and quantity increases are necessary in today’s 
military forces. Most presently used systems operate in the 
electromagnetic spectrum above the HF range and as a 
consequence, much communications takes place on uncovered 
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same time increasing performance. Because of the 
advantages, digital techniques will be employed in the 
future for voice communications. The system proposed here 
has numerous advantages because of digital techniques and 
the potential for changing the whole method of 
communicating. The most difficult problem which remains is 
transferring modern technology and implementing these 
techniques. The proposed system is readily adaptable to 
both presently used analog lines as well as modern. 
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high data lines 



conditioned , 

Costs of the proposed system can be reduced over a 
period of time and with increased numbers of the systems, 
costs continue to decrease. Potential savings in line usage 
alcne have been discussed and it has been shown how the 
system could conceivably pay for itself over time. 

The system is already in itself secure. Communications 
security procedures and publication handling could be 
designed to integrate with present procedures. An additional 
advantage exists in that current software requirements for 
ceding, keylists, etc., could be reduced significantly. 

Presently used voice coding systems have a 
synchronization process which they go through prior to 
establishing communications. With the proposed system, 
synchronous linking would be established at the patch panel 
either manually, or through previously programmed methods. 

Bandwidth is a technical resource which must be 
conserved. This proposed system offers a method of 
conserving bandwidth by increasing the efficiency of its 
usage. 

For military applications there exist some apparent 
disadvantages. Host important is in the oscillators 
themselves. Typically, oscillators tend to drift in 
frequency and tuning accuracy is difficult. Maintenance of 
properly tuned and stabilized oscillators may be an 
exj-ensive and unforseen cost which could affect the overall 
system significantly. 

Duplicate systems would have to be available in critical 
communications links. If only one system were in operation 
and the system became disabled because of loss of oscillator 
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frequency stabilization or for any other reason, a back-up 
system would be required thus further increasing costs. 

A further disadvantage exists in that an alarm system to 
wain the user if he is actually talking in a secure fashion 
would be required. As the system exists, there is no method 
of determining whether the equipment is in fact operating in 
either a secure or an unsecure mode. 
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V. 



RECOMMEUDATIONS 



A continuing investigation of this proposed voice coding 
system is needed. Application to telephone lines have bean 
discussed cut there remains potentially dollar and bandwidth 
savings to be realized in HP, UHF, satellite, and even in LF 
and VLF applications. Use of the proposed system in these 
areas requires further investigation. 

The proposed system should be officially investigated by 
the Navy in an in depth feasibility study. To ease the 
complications of communicating with Electronic Music Studios 
in London, it would be beneficial for some military unit 
there (not necessarily limited to the Navy) to make an 
on-site investigation and study. The Office of Naval 
Research londcn would be the prime candidate. 

Since the system has proven itself in the civilian 
arena, it should be demonstrated and applied using military 
peripheral equipment. 

• The use of a programmable, changing and flexible random 
code changing device needs to be investigated further. 

Testing of privacy and intelligibility, flexibility and 
security must be coordinated with the National Security 
Agency tc find if standard requirements can be met. 
Reliability tests should be included. 



further design modifications should be considered for 
the use of microcomputers and other large scale integration 
devices for potential use as portable equipment in field 



operations. 



It is realized that Electronics Music Studios literature 
has provided a large portion of the data for this study and 
in that context it may be somewhat biased. The United 
States spends annually millions of dollars [2] in research 
and development of communications security equipment; it is 
hoped that some of that could be invested in this system. 
In any event, the initial equipment exists and secure voice 
communication with bandwidth reduction has been realized. 
It is the hope of this author that future investigation and 
Waval interest will lead to a complete, or at minimum a 
partial operational network. 
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